[QNN EP] Fix 16x16 MatMul translation #24846

quic-tirupath · 2025-05-23T05:50:37Z

Description

QNN's 16x16 FC doesn't support asymmetric int16 weight
QNN's 16x16 MatMul doesn't support asymmetric int16 weight initializer.
Insert Convert Op to convert from asymmetric uint16 weight to symmetric int16 weight.
Add unit tests to verify 16x16 MatMul translations.

Motivation and Context

This fix schedules 16x16 MatMul Ops on QNN HTP accelerator.
This improves inference time of Models contain 16x16 MatMul operators

HectorSVC · 2025-05-23T21:17:12Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-05-23T21:17:31Z

Azure Pipelines successfully started running 5 pipeline(s).

quic-tirupath · 2025-05-28T22:28:20Z

Analysis of failures reported in CI pipeline:

Linux QNN CI Pipeline is failing at the newly added Unit test. I can verify the Unit test is passing on my local setup. Seems the SoC being used in Linux QNN CI pipeline verification is very old and INT16 is not supported on this SoC. We will keep the unit test enabled for Windows toolchains until SoC is upgraded for Linux QNN CI pipeline testing.
Window QNN CI pipeline failures are intermittent i guess and not related to the code changes made as part of this PR. They may not be seen in re-run.

HectorSVC · 2025-05-28T23:10:59Z

Analysis of failures reported in CI pipeline:

Linux QNN CI Pipeline is failing at the newly added Unit test. I can verify the Unit test is passing on my local setup. Seems the SoC being used in Linux QNN CI pipeline verification is very old and INT16 is not supported on this SoC. We will keep the unit test enabled for Windows toolchains until SoC is upgraded for Linux QNN CI pipeline testing.

Window QNN CI pipeline failures are intermittent i guess and not related to the code changes made as part of this PR. They may not be seen in re-run.

we only enabled HTP tests for Linux environment, but it is using QNN simulator, not real device.

HectorSVC · 2025-05-28T23:11:17Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-05-28T23:11:35Z

Azure Pipelines successfully started running 5 pipeline(s).

quic-tirupath · 2025-05-30T04:24:17Z

@HectorSVC
The commit is completely QNN EP related. Seems Web CI pipelines ran for long time and finally canceled. Could you please check the problems with Web CI pipelines and unblock this commit.

Thanks in advance.

HectorSVC · 2025-06-02T16:58:46Z

There was a fix for the Web CI pipeline, please merge the code from latest main branch.

- QNN's 16x16 FC doesn't support asymmetric int16 weight - QNN's 16x16 MatMul doesn't support asymmetric int16 weight initializer. - Insert Convert Op to convert from asymmetric uint16 weight to symmetric int16 weight. - Add unit tests to verify 16x16 MatMul translations.

- MatMul 16x16 is supported on few hardwares - Disable MatMul 16x16 unit tests for linux platforms

quic-tirupath · 2025-06-02T18:24:49Z

There was a fix for the Web CI pipeline, please merge the code from latest main branch.

@HectorSVC Thanks for the inputs. I merged the code from latest main branch. Could you please re-trigger CI pipeline?

HectorSVC · 2025-06-02T19:37:11Z

/azp run Linux QNN CI Pipeline,Win_TRT_Minimal_CUDA_Test_CI,Windows ARM64 QNN CI Pipeline,Windows GPU Doc Gen CI Pipeline,Windows x64 QNN CI Pipeline

azure-pipelines · 2025-06-02T19:37:33Z

Azure Pipelines successfully started running 5 pipeline(s).

### Description - QNN's 16x16 FC doesn't support asymmetric int16 weight - QNN's 16x16 MatMul doesn't support asymmetric int16 weight initializer. - Insert Convert Op to convert from asymmetric uint16 weight to symmetric int16 weight. - Add unit tests to verify 16x16 MatMul translations. ### Motivation and Context - This fix schedules 16x16 MatMul Ops on QNN HTP accelerator. - This improves inference time of Models contain 16x16 MatMul operators

HectorSVC added the ep:QNN issues related to QNN exeution provider label May 23, 2025

quic-tirupath mentioned this pull request May 29, 2025

[QNN EP] Add 16x16 Gemm translation #24849

Merged

quic-tirupath added 2 commits June 2, 2025 11:23

[QNN EP] Disable MatMul 16x16 unit tests for linux platforms

698f087

- MatMul 16x16 is supported on few hardwares - Disable MatMul 16x16 unit tests for linux platforms

quic-tirupath force-pushed the dev/tirupath/matmul_16x16 branch from aae6e64 to 698f087 Compare June 2, 2025 18:23

HectorSVC approved these changes Jun 5, 2025

View reviewed changes

HectorSVC merged commit 46caf47 into microsoft:main Jun 5, 2025
82 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[QNN EP] Fix 16x16 MatMul translation #24846

[QNN EP] Fix 16x16 MatMul translation #24846

Uh oh!

quic-tirupath commented May 23, 2025

Uh oh!

HectorSVC commented May 23, 2025

Uh oh!

azure-pipelines bot commented May 23, 2025

Uh oh!

quic-tirupath commented May 28, 2025

Uh oh!

HectorSVC commented May 28, 2025

Uh oh!

HectorSVC commented May 28, 2025

Uh oh!

azure-pipelines bot commented May 28, 2025

Uh oh!

quic-tirupath commented May 30, 2025

Uh oh!

HectorSVC commented Jun 2, 2025

Uh oh!

quic-tirupath commented Jun 2, 2025

Uh oh!

HectorSVC commented Jun 2, 2025

Uh oh!

azure-pipelines bot commented Jun 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[QNN EP] Fix 16x16 MatMul translation #24846

[QNN EP] Fix 16x16 MatMul translation #24846

Uh oh!

Conversation

quic-tirupath commented May 23, 2025

Description

Motivation and Context

Uh oh!

HectorSVC commented May 23, 2025

Uh oh!

azure-pipelines bot commented May 23, 2025

Uh oh!

quic-tirupath commented May 28, 2025

Uh oh!

HectorSVC commented May 28, 2025

Uh oh!

HectorSVC commented May 28, 2025

Uh oh!

azure-pipelines bot commented May 28, 2025

Uh oh!

quic-tirupath commented May 30, 2025

Uh oh!

HectorSVC commented Jun 2, 2025

Uh oh!

quic-tirupath commented Jun 2, 2025

Uh oh!

HectorSVC commented Jun 2, 2025

Uh oh!

azure-pipelines bot commented Jun 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants